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Abstract. We study how the realignment criterion (also called computable cross- norm 
criterion) succeeds asymptotically in detecting whether random states are separable or 
entangled. We consider random states on C d ® C d obtained by partial tracing a Haar- 
distributed random pure state on C d <g> C d (g> C s over an ancilla space C s . We show 
that, for large d, the realignment criterion typically detects entanglement if and only if 

C*/ . • s ^ (8/37r) 2 <i 2 . In this sense, the realignment criterion is asymptotically weaker than the 

O I ' partial transposition criterion. 

S ■ 

Introduction 

A central problem in Quantum Information Theory is to decide whether a state is sep- 

£>• ! arable or entangled. Although this is a computationally hard task [9] , several operational 

criteria have been proposed to detect entanglement, such as the Peres partial transpose 

criterion (PPT) [15] and the realignment criterion due to Rudolph and Chen-Wu [61 [16] . 

In this paper, we focus on the realignment criterion, study its behaviour on large 
CH ! dimensional bipartite systems and compute the threshold for induced random states. 

These are random states on % = C d <S> C d which are obtained as the partial trace over 
an ancilla space C s of a Haar-distributed random pure state on 7i ® C s . We show that 
a threshold for the realignment criterion occurs at s = (8/37r) 2 <i 2 « 0.72<i 2 , in the 
following sense: if the ancilla dimension s is smaller that sq, with large probability the 
realignment criterion detects that the random state is entangled; when s is larger than 
So, the realignment criterion fails. Since the threshold for the PPT criterion is larger (it 
occurs at s = 4d 2 , see [2]), this means that the realignment criterion is asymptotically 
weaker that the Peres criterion. 

Here is a more striking illustration: take a random state uniformly distributed (with 
respect to the Lebesgue measure) on the convex body of all mixed states on C d (g> C d 
(this corresponds to s = d 2 , see [H]). Then, with probability tending to 1 as d tends to 
infinity, the following occurs: this state is entangled, and its entanglement is detected by 
the Peres criterion but not by the realignment criterion. 

Our proofs are based on a new model in Random Matrix Theory: realigned Wishart 
matrices. Let W be a Wishart random matrix (W = XX*, where X is a d 2 x s matrix 
with i.i.d. Gaussian entries). We study the realignment of the difference W — sld (which 
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is a non- Hermit ian matrix), and show that the asymptotic singular value distribution is 
given, under proper normalization, by a quarter circle law. 

The fact that the realignment criterion is generically asymptotically weaker that the 
Peres criterion is illustrated in a qualitative fashion when we focus on unbalanced bipar- 
tite systems. We consider induced random states on C dl Cg> C d2 (the dimension of the 
environment being still s). When d\ is fixed and d>2 tends to infinity, we show that the 
threshold for the realignment criterion is s = d\. This is to be compared with the cor- 
responding threshold for the Peres criterion, which is not bounded with respect to di (it 
was shown in [3] that this threshold is s = a(di)d2, with a(di) = 2di + 2^/d\ — 1). 

The paper is organized as follows: Section [T] introduces the background and states the 
main theorem about the threshold for the realignment criterion on C d <g> C d (Theorem 
II. ip . Section [2] introduces realigned Wishart matrices, and contains Theorem 12.11 about 
convergence to the quartercircle distribution. Section |3] contains a derivation of Theorem 
11.11 from Theorem 12.11 Section H] introduces the graphical calculus which is used in the 
proof of Theorem l2.ll In section[5]we compute the moments of realigned Wishart matrices, 
and the proof of Theorem 12.11 is completed in Section [6j Finally, Section [7] deals with 
unbalanced tensor products. 



1. Background and statement of the results 

1.1. Permutation criteria. We consider a bipartite Hilbert space H = C dl ® C d2 . Let 

{ e i}i=i an d {fj}jL\ be the canonical bases of C dl and C d2 . Any operator A onH admits 
a (double-indexed) matrix representation 

di d,2 

A = ^2^2 A ijM \ei <g> fj)(e k <g> fi\. 

i,k j,l 

For every permutation o of the indices {i,j,k,l}, we can introduce the corresponding 
reshuffling operation, which maps the matrix A = {Aij^i) to the (possibly non-square) 
matrix A a = (A a ^ a ^^ k ) a ^). This operation depends on the particular choices of bases. 

If p is a pure product state, then Hp^Hi = 1. Consequently, for any separable state p, we 
have ||p CT ||i ^ 1. Each permutation yields to an operational separability criterion. Such 
reshufflings were studied in [TT] , where it was proved that each of these 24 reshuffling is 
equivalent to one of the following 

(1) The trivial reshuffling, where ||^4 CT ||i = ||^4||i for every A. 

(2) The partial transposition [15], denoted A r , which corresponds to swapping the 
indices j and I. This operation is equivalently described as A r = (Id(B)T)A, where 
T is the usual transposition of d x d matrices. Note that for a state p, the condition 
||p r ||i ^ 1 is equivalent to p r ^ 0, and this is known as the PPT criterion (positive 
partial transpose). 



REALIGNING RANDOM STATES 3 

(3) The realignment [El [16], denoted A R , which corresponds to swapping the indices 
j and k. We have 

di 6,2 
i,k j,l 

The resulting matrix A R has dimension d\ x d\. The fact that a separable state 
p must satisfy Hp^Hi ^ 1 is called the realignment criterion or the computable 
cross-norm criterion. 

Except in Section [7J, we focus on the balanced case {d\ — d 2 — d). In this case the 
matrix A R is square; however note that the realignment of an Hermitian matrix does not 
produce a Hermitian matrix in general. Note also that Id = dE, where E — E^ is the 
maximally entangled state E = \ip)(ip\, with \ip) = 4= J2i=i l e * ® e *)- Obviously, one has 
also Id = (dE) R . 

It is known [16] that when d ^ 3, neither of the PPT or realignment criteria is stronger 
(one can find states which violate one criterion and satisfy the other one). In this paper, we 
show that when d is large, the PPT criterion is generically stronger that the realignment 
criterion. 

1.2. Random states. We consider the standard model of induced random states. These 
mixed states are obtained as partial traces (over some environment) of Haar-distributed 
random pure states. More precisely, we denote by /i„ jS the distribution of the state 

where ip is uniformly distributed on the unit sphere in C n <g> C s . In the following, we 
identify C n with C d <g> C d , for n = d 2 . 

When s ^ n, the probability measure /x njS has a density with respect to the Lebesgue 
measure on the set of states on C n which has a simple form [TH] 

(i) ^=fw = - L (dfltpr n , 

dvol Z n:S 

where Z n>s is a normalization factor. Note that formula ([T|) allows to define the measure 
/i„ jS for every real s ^ n, while the partial trace construction makes sense only for integer 
values of s. 

The dimension s of the environment can be thought of as a parameter. The resulting 
mixed state is more likely to be entangled when s is small. On the other hand, if s — > <x>, 
the resulting mixed state converges to the maximally mixed state, which is separable. 
Therefore, for any separability criterion, we expect a threshold phenomenon — a critical 
value between the range of s where the criterion is generically true and the range where it 
is generically false. Known results in this direction include, for a random state p G C d ®Q d 
with distribution (/,# s , 
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(1) For separability vs entanglement, the threshold is of order s ~ d 3 . More precisely, 
p is typically entangled when s ^ d 3 , and typically separable when s £3 d 3 log 2 d 

ffl. 

(2) For the PPT criterion, the threshold occurs precisely at s = Ad 2 : p is typically 
not-PPT when s < Ad 2 and typically PPT when s ^ 4d 2 0. 

(in this context, a property is called typical if the probability that it holds goes to 1 as d 
tends to infinity). 

We show that the threshold for the realignment criterion is precisely at s = (8/3n) 2 d 2 . 
This is our main theorem. 

Theorem 1.1. Denote'-/ = (8/37r) 2 « 0.72. For every e > 0, there exist positive constants 
c(e), C(e) such that the following holds. If p is a random state on C d ®C d with distribution 
Pd 2 ,s, then 

(1) Ifs ^ in-e)d 2 , then 

¥{\\p R \\i >1) > l-C(e)exp(-c(e)max(s,t/ 1/4 )). 

(2) Ifs ^ (-i + e)d 2 , then 

P(\\p R \\ 1 ^l)^l-C(e)ex V (-c(e)s). 

A comparison between the thresholds for the Peres and realignment criteria shows that 
the latter is generically asymptotically weaker. When the environment dimension s is 
between (8/37r) 2 d 2 and 4c? 2 , random states are non-PPT, but the realignment criterion 
fails to detect entanglement. As noted in the introduction, this range includes the special 
case s = d 2 , which corresponds to the uniform measure on the set of states (the density 
in equation ([T]) being constant). 

The theorem will follow from the description of the limiting distribution of singular 
values of p R , which are shown to converge towards a quarter-circle distribution. This 
result can be equivalently stated using Wishart matrices instead of random quantum 
states, which are more convenient from a random matrix theory perspective. We develop 
this approach in the following section. 

We also consider the case of unbalanced states, i.e. C dl ® C d2 with d\ fixed, and 
d 2 tending to infinity. In this asymptotic regime, we show that the threshold for the 
realignment criterion is exactly s = d\. Since the threshold occurs for a finite value of 
s, the realignment criterion is qualitatively weaker in the unbalanced case than the Peres 
criterion (see [3]). 

2. Spectral distribution of realigned Wishart matrices 

We describe here a new result from random matrix theory, which is the main ingredient 
in the proof of Theorem 11.11 Let X be a d 2 x s random matrix with i.i.d. A/"c(0, 1) 
(standard complex Gaussian) entries, and W = XX^ . The random matrix W is known 
as a Wishart random matrix of parameters (d 2 ,s). We consider W as an operator on 
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C d <g> C d , therefore we can consider the realignment of W, denoted R = W R . We may 
write Wjt t8 , Rd 2 ,si ■ ■ ■ instead of W, R, . . . if we want to make dimensions explicit. 

It turns out that to be simpler to study the difference W — sld rather than W itself, 
or equivalently to study the operator R — dsE (recall that Id = dE). The following 
theorem (proved in Section |BJ) describes the asymptotic behaviour of the singular values 
of R — dsE. We denote by Cat p := — r[( 2p ) the pth Catalan number. 

Recall that the (standard) quarter-circle distribution is the probability measure 



y/A^ 



x 



2 



fi qc = 1 [0,2] 0*0 dx. 



TV 



The even moments of this measure are given by the Catalan numbers: 

^Vrln (rr\ — Ha, 



J X d[ln C \X J — \_jQXj 

Jo 



whereas the odd moments are 



2 2p+1 2^ 5 p\(p + 2)\ 

x p+ dfi qc {x) = . 

7r(2p + 4)! 



In particular, the average of \i qc is 8/(37r). 



Theorem 2.1. For every integers s,d, let R^ s be the realignment of a Wishart matrix 
W(p >s , and Q = Qd 2 , s = -jj^{RtP,s ~ dsE). Then, when d and s tend to infinity, 

1 

d,s->oo d? 



lim E — Ti[(QQ*) p ] =Cat p , 



lim Var-J-Tr[(gg*) p ] = 0. 

We emphasize that there is no assumption about the relative growth of s and d, besides 
the fact that they both tend to infinity. 

Theorem 12.11 can be restated as follows: the empirical singular value distribution of Q, 
defined as -^ Yui=i a i(Q)i converges in moments towards a quarter-circle distribution ji qc . 
Via standard arguments (see e.g. [2] for a sketch), it follows that for every continuous 
function / : R + — > R with polynomial growth, we have (convergence in probability) 

(2) ^Tr/dg^l) A / fdfx sc . 
When applied to the function f(x) = x, equation ([2} yields 

(3) ||Q d vlli~^ 

p 

(by X ~ Y we mean that the ratio X/Y converges in probability towards 1). 
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3. From Wishart matrices to random states: proof of Theorem 11.11 

In this section we show how to derive Theorem 1 1 . 1 1 from the results on Wishart matrices. 
Induced random states are closely related to Wishart matrices. Indeed, it is well known 
that if W — W<p }S is a Wishart matrix, then p := (Triy)" 1 !^ is a random state with 
distribution Pd 2 ,s- 

We are going to prove first a weak form of the theorem, where we show that the 
threshold is of order d 2 , without identifying the exact constant 7 = (8/37r) 2 . Here is the 
relevant proposition; the proof will use only the expansion of Tt(QQ*) p for p — 1 and 
p = 2, and will have the advantage to work for all values of s (including s <C d 2 or s 3> d 2 , 
which require special attention). 

Proposition 3.1. There exist absolute constants c,C,cq and Co such that, for a random 
state p on C d <S> C d with distribution Pd 2 ,s, the following holds 

(1) If 8 ^ c rf 2 ; then P(||p fi ||i > 1) > 1 -Cexp(-cd 1 / 4 ). 

(2) If s ^ C^d 2 , then P(||p fi ||i < 1) ^ 1 - Cexp(-cc/ 1 /4). 

Assume that p is defined by the equation p := {TiW)^ 1 W , where W is a Wishart 
random matrix with parameters (d 2 , s). Define a by Tr W = (1 + a)d 2 s. The matrix Q 
from Theorem 12. II is related to p by the equation -jj^Q = (1 + a)p R — E/d. In particular, 



(4) 



( WQWi _A < uru ^ J_ (\\Qh + A 

\ dy/s d) ' 1 + a \ d^/s d) 



1 + a \ dy/s d / 

The random variable Tr W follows a x 2 distribution, and the next lemma implies that 
with large probability, \a\ ^ l/(dy/s). 

Lemma 3.2 (see e.g. [13], Lemma 1). IfW = Ws.s is a Wishart matrix, then for every 
<e < 1, 

Tr W \ „ , 9 , 

— 1 > e J < 2exp {-ce 2 d 2 s ). 

d 2 s J 

Proof of Proposition fXTl It is possible to estimate ||Q||i from the knowledge of HQH2 and 
|| Q || 4 , using the following inequalities 

IIOII 3 
(5) |^<HQ||i<«2. 

Since HQlli = Tr(QQ*) and \\Q\\\ = Tr(QQ*QQ*) are polynomials in the matrix entries, 
they are easier to analyze. Here is a proposition which can be derived from the analysis 
in Section E] — more precisely see equations ( TTTj) . (TT2"j) and (IT3|) . 

Proposition 3.3. There are absolute constants c, C such that the following inequalities 
hold for every d and s 

cd 2 ^ E IIQH 2 ^ Cd 2 , cd 2 SC E \\Q\\t ^ Cd 2 , 
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Var IIQH1 ^Cd 2 , Var||Q||^ Cd 2 

We now use a general concentration inequality for polynomials in Gaussian variables. 

Proposition 3.4 (see [12] , Theorem 6.7). Let (Gi) be independent Gaussian variables, and 
P be a polynomial of (total) degree q. Consider the random variable Y = P(Gi, . . . , G n ). 
Then for every t > 0, 



-C q t 



2An 



P(|Y-EY| ^tVVarY) ^C,exp( 

(C q and c q being constants depending only on q). 

Applied to the polynomials ||Q||| and \\Q\\\ (of degree respectively 4 and 8), we obtain, 
that with large probability, both and ||Q||| and \\Q\\\ are of order d 2 (up to universal 
constant). Together with (JSJ), this yields that, for some absolute constants c,C 

P{cd 2 < IIQUi < Cd 2 ) ^ 1 - Cexp(-cd 1/4 ) 

we obtain that with probability larger than 1 — Cexp(— cd 1 ' 4 ), 

1 f cd 1\ . „ „„ . 1 (Cd v 
d 



(6) 



From ([6]) and 



^ 



\P Hi 



1 + a \y/s 
and Proposition 13.11 follows from Lemma 13.21 



^ 



1 + a \y/s d 



□ 



We denote by 1Z the (convex) set of states which are not detected to be entangled by 
the realignment criterion 

K = K{U) = {p state on U : \\p R \\x ^ 1}. 



We also introduce the gauge 
state p as 

\\p\\n 



\k associated to the convex body 1Z. It is defined for any 



Id 1/ Id . 



inf U > 



E 



E 



d t \ d 



<: 1 



Lemma 3.5. The following inequalities hold for any state p on C d 

d ,, D „ , .,, ,, ,, d , 



C d , 



IpX-E/dh^Wpl 



R 



^ 



\P 



E/d\\ : 



d+l ur ' " " r " d- 1 

Proof. Let A = \\p\\ii- Then ||^ + j (p R — -|) || = 1, and by the triangle inequality 



1 1 
1_ d ^ A 



r E 

" -j 



< 1 + 5 



and the result follows. 
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We are going to use a result from p], which gives a concentration inequality for the 
gauge of a random state (note that the inradius of 1Z equals the inradius of the set of 
separable states, which is 1/ \/d 2 (d 2 — 1), see [ID]). We obtain 

Proposition 3.6 QlJ, Proposition 4.2). For every constant Cq > 0, there are constants 
c, C such that the following hold. Let s ^ c^d 2 , and p be a random state on C d <8> C d with 
distribution Pd 2 ,s, then 

P(|||p|| w - Af| >rj) ^ C exp(-cs) + C exp(-csr/ 2 ), 

where M = M<p s is the median of the random variable \\p\\n- 

Note that Proposition 4.2 in [1] is stated with the restriction s ^ d 2 ; however it can 
be checked that the proof extends to the range s ^ c^d 2 for any c$ > 0, altering only the 
values of c and C. We can now pass to the complete proof of Theorem ll.l[ 



Proof of Theorem \l.l\ Let us show the first part of the statement, the second part being 
similar. Since the case when s ^ c^d 2 was covered by Proposition I3.1[ we may assume 
s ^ c d 2 . 

Denote by ffd 2 ^ the probability that a random state on C d ® C d , with distribution 
Pd 2 ,s, belongs to the set TZ. Fix s > 0. For each d, let s = Sd be the number with 
Cod 2 ^ s ^ (1 — e) , ~fd 2 such that 7T<22 jS is maximal. We claim that 

1 

liminf M d 2 ^ 



d-^CG " " ' yj\ — £ 

Indeed, by Lemma [3. 5 [ the random variables \\p\\n and \\p R — E/D\\i have asymptoti- 
cally equivalent medians. Moreover, by (j4]), it suffices to show that 

liminf Median I ^fc] > 



d^co \ d\fs J y/l — e' 

and this last inequality follows immediately from (EJ). 

Choose some r\ such that < rj < —J= 1. Applying Proposition 13. 6[ we obtain, for 

d large enough, 

7T d 2 yS = P(peTZ)^P (\\p\\ n < M d 2, Sd -v)< C(e) exp(-c(e)s d ). 

Small values of d are taken into account by adjusting the constants if necessary. 

For the second part of the theorem, consider the number s = s' d with s ^ (1 — e) , ~fd 2 
such that 7id 2 ,s is minimal. This number is well-defined since for fixed d, the sequence 
7r rf 2 s converges to 1 as s tends to infinity (the measures pd 2 ,s converge towards the Dirac 
mass at the maximally mixed state). The rest of the proof is similar. □ 
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4. Background on combinatorics of non-crossing partitions and 

graphical calculus 

Let us first recall a number of results from the combinatorial theory of noncrossing 
partitions; see [H] for a detailed presentation of the theory. For a permutation a G S p , 
we introduce the following standard notation: 

• #cr is the number of cycles of cr; 

• | cr | is its length, defined as the minimal number k such that a can be written as 
a product of k transpositions. The function (a, ix) — > |o" _1 7r| defines a distance on 
S p . One has #a + \a\ = p. 

Let £ G S p be the canonical full cycle £ = (12- --p). The set of permutations a G S p 
which saturate the triangular inequality \a\ + |o" _1 ^| = |£| = p — 1 is in bijection with 
the set NC(p) of noncrossing partitions of [p] := {1, . . . ,p}. We call such permutations 
geodesic and we shall not distinguish between a non crossing partition and its associated 
geodesic permutation. We also recall a well known bijection between NC(p) and the set 
NC 2 (2p) of noncrossing pairings of 2p elements. To a noncrossing partition tc G NC(p) 
we associate an element fat(7r) G NC 2 {2p) as follows: for each block {i ly i 2 , . . . ,i k } of 
it, we add the pairings {2i 1 — 1, 2i k }, {2ii, 2i 2 — 1}, {2i 2 , 2i 3 — 1}, ... , {2i k _ 1 , 2i k — 1} to 
fat(7r). The inverse operation is given by collapsing the elements 2i — 1, 2i G {1, . . . , 2p} 
to a single element i G {1, . . . ,p}. 

In the rest of the paper we shall perform moment computations for random matrices 
with Gaussian entries. The main tool we use is the Wick formula (see e.g. [18] for a 
proof) 

Lemma 4.1. Let X%, . . . , X k be jointly Gaussian centered random variables. If k = 2p+ 1 
then ELY! ■■■X k }=0. If k = 2p then 

(7) E[X 1 ...X k }= J2 1] E MJ 

K={{h,jl},-,{ip,jp}} m=1 
pairing o/{l,...,fe} 

The above formula is very useful when applied to moments of Gaussian random matri- 
ces. Moreover, a graphical formalism adapted to random matrices was developed in [8] in 
order to facilitate its application. This graphical calculus is similar to the one introduced 
in [7] for unitary integrals and the corresponding Weingarten formula. We present next 
the basic ideas of the Gaussian graphical calculus and we refer the interested reader to 
[S] for the details. 

In the Gaussian graphical calculus tensors (and, in particular, matrices) are represented 
by boxes. In order to specify the vector space a tensor belongs to, boxes are decorated 
with differently shaped symbols, where each symbol corresponds to a vector space. The 
symbols are empty (white) or filled (black), corresponding to primal or dual spaces. Tensor 
contractions are represented graphically by wires connecting these symbols. A wire should 
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q 



x 



'* X u 

■ A b 



paired boxes 
Figure 1 . Pairing of boxes in the Gaussian graphical calculus 

always connect two symbols of the same shape (corresponding thus to vector spaces of the 
same dimension). A wire connecting an empty symbol with a filled symbol of the same 
shape corresponds to the canonical map C n (g) (C n )* — > C. However, we shall allow wires 
connecting two white or black symbols, by identifying non-isomorphically a vector space 
with its dual. Finally, a diagram is a collection of such boxes and wires and corresponds 
to an element in a tensor product space. 

The main advantage of such a representation is that it provides an efficient way of 
computing expectation values of such tensors when some (or all) of the boxes are random 
matrices with i.i.d. Gaussian entries. Indeed, there exists an efficient way of implementing 
the Wick formula in Lemma I4TT1 When the entries of the Gaussian matrices have standard 
normal distributions, the covariances in equation ([7]) are just delta functions. We state 
now the graphical Wick formula from [8]. 

Theorem 4.2. Let T> a diagram containing p boxes X and p boxes X which correspond 
to random matrices with i.i.d. standard Gaussian entries. Then 



E X [D] 



aeS„ 



V r 



where the diagram T> a is constructed as follows. One starts by erasing the boxes X and X , 
but keeps the symbols attached to these boxes. Then, the decorations (white and black) of 
the i-th X box are paired with the decorations of the a{i)-th X box in a coherent manner, 
see Figure [IJ In this way, we obtain a new diagram T> a which does not contain any X 
or X boxes. The resulting diagrams T> a may contain loops, which correspond to scalars; 
these scalars are equal to the dimension of the vector space associated to the decorations. 

We now present a simple example of moment computation that makes use of the Gauss- 
ian graphical calculus. Let W G MdiC) be a complex Wishart matrix of parameters (d, s), 
that is W = XX* with X e M^ XS (C) with i.i.d. standard complex Gaussian entries. The 
diagrams for the first and the second moment of W are presented in Figure [2j Since these 
diagrams contain only Gaussian boxes, the resulting expanded diagrams T> a will contain 
only loops, so they will be scalars. For the first moment, the diagram contains only one 
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X 3 c X 




X 



Figure 2. Diagrams for the first two moments of a Wishart matrix. Round 
symbols correspond to C d and diamond symbols correspond to C s . 



*>(U 




> i 
i 




ds 



Figure 3. Graphical expansion for the first moment of a Wishart matrix. 
There is only one term in the sum, corresponding to the unique permutation 
on one element. 

pair of Gaussian matrices X/X, hence the expected value of the trace is given by the 
following one term sum (see Figure [3]) 



ETiW = ^V a = ds. 



For the second moment, there are two pairs of Gaussian boxes, thus the formula in 
Theorem 14.21 contains two terms, see Figure HJ 

ETr W 2 = J2 V * = P (i)(2) + %2) = ds 2 + d 2 s. 

a£S2 



5. Moment formula for the singular values of a realigned Wishart 

MATRIX 



In this section we deduce a formula for realigned Wishart matrices, using Theorem 14.21 
We are going to work in the more general setting of unbalanced tensor products. We 
consider Wishart matrices W G M dl (C) (g> M d2 (C) of parameters (d±d 2 , s), i.e. W = XX* 
with X G M dld2XS (C) having i.i.d. A/c(0, 1) entries. 

Let R = W R e M d 2 xd 2(C) the realigned version of W, that is 

Rij,kl = Wikjl- 
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£>(1)(2) = 




= ds 2 



P (12) = 




X3 Os 




= d 2 



Figure 4. Graphical expansion for the second moment of a Wishart ma- 
trix. There are two terms in the sum, corresponding to the identity permu- 
tation (1)(2) and to the transposition (12). 
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Figure 5. The diagram of a realigned Wishart matrix. Square symbols 
correspond to C dl , round symbols correspond to C d2 and diamond-shaped 
labels correspond to C s . 

The diagram of the matrix R is presented in Figure [5j 
Proposition 5.1. The moments of the random matrix RR* are given by 
(8) ETr [(RR*f] 



J2 s^df^d*^, 



where the permutations 7, 5 e S 2p are given by 

7 =(12)(34)---(2p-l,2p) 
«J=(l,2p)(23)(45)...(2p-2,2p-l). 

Proof. In FigureElwe represent the matrix i?i?* and its p-th moment. The second diagram 
contains 2p W boxes, each W box being represented in Figure We use Theorem 14.21 to 
compute the average 

ETr[(RR*y}= J2 V ^ 

a£S2 p 
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Tr [(RR*) 




Figure 6. The diagrams of the product RR* and of the p-th moment of RR*. 

where D a is the diagram obtained after the removal of the X and X* boxes and connecting 
the i-th X box with the a(i)-th X* box. Since the random boxes are the only tensors 
appearing in the diagram, each D a will contain only loops that can be counted in the 
following way: 

(1) There are j^a loops coming from squared labels (associated to C s ), because the 
initial wiring of these labels is given by the identity permutation. 

(2) There are ^(ck7 _1 ) loops coming from the lower round labels (associated to C d ' 2 ). 
The initial wiring of these labels is given by the permutation 

7 = (12)(34)...(2p-l,2p). 

(3) There are #(a<5 _1 ) loops coming from the upper square labels (associated to C dl ). 
The initial wiring of these labels is given by the permutation 

5=(l,2p)(23)(45)---(2p-2,2p-l). 

Taking into account all contributions, we obtain the announced moment formula for RR*. 

D 



6. Proof of Theorem 12.11 



Recall that in Theorem 12.11 we are considering the balanced case, d\ = d 2 = d. Note 
that 

QQ* = d- 2 s-\RR* - dsRE d - dsE d R* + d 2 s 2 E d ), 

so that one can expand the p-th moment as 

v 
ETr[(QQ*) p ]=d- 2p s- p J^ (-l) E » (/lW+/2(l)) ETr JJ (F 1 (i)F 2 (i)) , 

/l,2:[p]-K0,l} i=1 
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dsRE d 


• 

= ds 

• 






c 
c 


RR* 

a{2i) = 2% 




2i- 1 



2i 



Figure 7. The diagrams of dsRE^ and of RR*, with the constraint a(2i) 
2i are the same. 



where 



and 



m 



R when fi{i) — 

dsEd when fi{i) = 1 



Fo(i) 



R* when f 2 (i) = 0, 

dsEd when i^) = 1- 

We are going to use now a trick that will allow us to compute the expected value in 
the general term above in the same manner as we did for RR* in Proposition 15.11 The 
idea, presented graphically in Figure U\ is that when one uses the graphical expansion 
formula for the expected value above, it is as if we had only RR* terms, but the set of 
permutations we allow is restricted to 

S2M1J2) = {ae S 2p \Vi E fr\l), a(2i - 1) =2i- 1 and V* E f 2 \l), a{2i) = 2i}. 
Using the result in Proposition 15. H we obtain 

ETT[(QQ*) I >]=d- 2p s- p J2 (-l)^ (/l(i)+/2(l)) Yl s# a d#^^d#^ a5 ^ 

/l,2:[p]-H0,l} aeS 2p (fiJ 2 ) 

= d~ 2p s~ p y^ s # a d#( Q ^~ i )d#( a ' 5 " 1 ) y^ (_i)Ei(/i«+/2«)_ 

Note that in the last equality we inverted the summation order, so we had to restrict the 
set of choice functions /i^ to the family 

?(a) = {(A, f 2 ) I a(2i - 1) ^ 2z - 1 => /i(z) = and a(2z) ^ 2z => / 2 (z) = 0}. 
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Let us show now that whenever a permutation a G S 2p has a fixed point io, the corre- 
sponding sum 

(9) y^ (_i)£i(/i(0+M0) 

(/i,/2)e.F(a) 

equals zero. For such a permutation and an element / = (j\, f 2 ) G ^"(o;), define another 
pair / = (/1, / 2 ) as follows. If io is odd, io = 2j — 1, then put f 2 = f 2 and 

fC\ = \ 1 -hU) if J = Jo, 
{3> \hU) if 3* Jo- 

For even z = 2j , define /1 = /1 and 

s/.x = f 1 -/2O') if J = fa> 

/2UJ \/ 2 (j) if 3*3* 

Since z'o is a fixed point of a, we have / = (/1, ^2) G J 7 (en) and the map / — > f is thus 
an involution without fixed points acting on F(a). Notice also that changing a single 
value in a pair (fi,f 2 ) changes the parity of the sum ^j(/i(i) + /ijfa"))- This concludes 
our argument that the sum ([9]) is null whenever a has a fixed point. 

We have thus shown that permutations with fixed points cancel each other out, so we 
have 

ETr [{QQ*f\ = d- 2p s- p J2 s* a d#^~^d#^ 5 '^ J2 (-l) E * (/lW+/2W) , 

°e-s 2 ° p (/i,/2)e^(a) 

where we denote by S 2p the set of permutations of [2p] without fixed points. For such a 
permutation a, the set of admissible choices J- {at) contains only one element / = (/1, f 2 ) 
with fi(i) = f2{i) — in such a way that the above formula simplifies to 

(10) ETr[(QQ*) p ]= ^ d~ 2p s~ v s* a d* {a ^ 1] d*^ 5 '^ . 

For small values of p, we obtain 

(11) ETr[QQ*] = d 2 } 

(12) E Tr [{QQ*) 2 } = 2d 2 + 2s~ 1 d 2 + 1 + 4s" 1 . 

We show next that the dominating term in the sum above is of the order d 2 and that 
it is given by permutations a which are non-crossing pair partitions of [2p\. Since there 
are Cat p such permutations, we obtain 

(13) ETr[(QQ*) p ] = d 2 Cat p (l + o(l)), 
which is the moment formula we aimed for. 
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Let us consider separately the exponents of d and s in the general term of the sum ( ITU1) . 

g(a) = -2p + #(a 7 " 1 ) + #(a^ 1 ) = 2p - (\aT X \ + M -1 !) 
h(a) = —p + #ct = p — \a\. 

Using the the triangular inequality and the fact that a has no fixed points, we obtain 

|o7 -1 | + |a<* -1 | > |7<J| = | (2p - 1 2p - 3 - ■ • 531) (246 • • - 2p) | = 2p - 2, 
|a| >p, 

which shows that we have indeed g(a) ^ 2 and h(a) ^ 0. In order to conclude, it 
remains to be shown that the permutations which saturate both inequalities are exactly 
the non-crossing pair partitions of [2p\. The fact that a has no fixed points and that 
|a| = p implies that a is indeed a product of p disjoint transpositions i.e. a pair partition. 
To show that it is non-crossing, we start from the geodesic condition |a7~ x | + |a5 _1 | = 
\jS\ = \(2p — 1 2p — 3 • • -531)(246 • • -2p)\. This implies that the permutation 7a lies 
on the geodesic id — > jS. More precisely, we can write a = 7il o n e where n o and IT e are 
permutations acting on the odd, respectively even elements of [2p] . The geodesic condition 
implies that these permutations come from non-crossing partitions 7r ,7r e e NC(p): 

n o (2z-l) = 27r- 1 (z)-l, 

n o (2«) = 2z, 

n e (2i - 1) = 2% - 1, 

n e (2z) = 2n e {t). 

The fact that a is an involution easily implies ir e = tt =: 71 G NC(p) so that the action 
of a is given by 

a(2i- 1) = 2n-\i), 
a{2i) = 2tt(z) - 1. 

This is equivalent to a = fat(7r) so that a is necessarily a non-crossing pair partition. 



Let us now prove the second statement in the theorem, by giving an estimate on the 
second moment of the random variable Tr \{QQ*) P }- 
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Using Theorem 14 .21 for the diagram of E Tr 2 [{QQ*) P ] , which is made of two disconnected 
copies of the bottom diagram in Figure O we obtain the moment expansion 

ETr 2 [{QQ*) P ] = d- 4p s- 2p J2 (-l)^ (/l(i)+/2W) - 

/l, 2 :[2p]-K0,l} 

/ V \ / 2p 

•E Tr[](fi(W)) Tr J] (F 1 (i)F 2 (i)) 

\i=l / V i=P+l 

= d~ 4p s~ 2p y^ s * a d* {ai ^ ) d* {aS ^ ) Y^ (_i)Ei(/i«+/2«)_ 

"£5 4p (/i,/2)e/(o) 

where the F and / functions have the same meaning as before and the permutations 
7i2,<5i2 are defined by 

712 = (12)(34) • ■ • (2p - 1, 2p)(2p + 1, 2p + 2) • ■ • (4p - 1, 4p) 

5i2 = (1, 2p)(23)(45) • • • (2p - 2, 2p - l)(2p + 1, 4p)(2p + 2, 2p + 3) • • • (4p - 2, 4p - 1). 

We can show, by the same technique as before, that permutations a with fixed points 
cancel each other out in the sum above, so we have 

ETr 2 l(QQ*) p ] = d- 4p s~ 2p J2 s* a d*( a ^d*( aS ^ 

We investigate next the dominating term in the sum above. The exponents of d and s 
in the general term read 

012(a) = -4p + #(«7r 2 1 ) + #(«^r 2 1 ) = %> - (i«7r 2 1 l + l^l) 

^12(0) = — 2p + #a = 2p — \a\. 
Using the the triangular inequality and the fact that a has no fixed points, we obtain 

Wli2\ + \ aS n\ > |7iA 2 | = 4p - 4, 
|a| ^ 2p, 

which shows that we have 012(0;) ^ 4 and /112(a) ^ 0. This proves that the sum behaves 
like d A times a constant. To evaluate this constant, we need to find the set of permutations 
a which saturate the above inequalities. As before, this set is made of pair partitions such 
that 7i 2 a lies on the geodesic id — > 712^12- Recall that both permutations 712 and 612 
have a product structure 

712 = 7i72 
5n = 5i5 2 , 

where 71,^1 act on [2p] and 72,(^2 act on 2p + [2p\. Hence, mimicking the reasoning in 
the first part of the proof, the permutations we want are those having also a product 
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structure a = ct\a.i, where a.\ and a% are non-crossing pair partitions of [2p] and 2p + [2p] 
respectively. Since one can choose a.\ and «2 independently, we conclude that 

ETr 2 [(Qg*) p ] = rf 4 (Cat; + o(l)), 

which, together with ( Tl3|) . achieves the proof of the theorem. 

Note that the function (712(a) takes only even values, and therefore permutations a 
such that (712(a) < 4 must actually satisfy g^ipt) ^ 2. This remark yields a bound on the 
variance 



(14) VarTr[(QQ*)^C p d 2 , 

where C p is a constant depending only on p. 

7. Realigning states in an unbalanced tensor product 

We analyze now an unbalanced tensor product C dl <8> C d2 , with d\ < d^- We consider 
the asymptotic regime where d\ is fixed and g?2 — > 00 and we show that the threshold 
occurs at a finite value of the parameter s, more precisely s = d\. 

Theorem 7.1. For every integers d\, s, there are constants C = C(d\, s) and c = c(di, s) 
such as the following holds. Let p be a random state on C dl ®C d2 with distribution n^a^s, 
then 



(1) If s < d\, then 

(2) Ifs > d\, then 



1/^ 



P(||p«||>l)^l-L7exp(-< 4 ). 



\p R \ < 1) ^ l-C7exp(-c4 /4 ) 



As in the balanced case, the result is based on a moment computation for a realigned 
Wishart matrix. 

Theorem 7.2. In the regime of fixed d±, s and g?2 — > 00 ; the empirical singular value dis- 
tribution of d 2 l R converges in moments to a Dirac mass at yfs. Moreover, the variances 
of the moments of the random matrix d 2 x R satisfy 

VarTr [{d 2 2 RR*f] = 0{l/d\). 

Proof. We need to prove that for every integer p, 

lim ^jETr \(dn 2 RR*) p ] = s p/2 . 

As before, we start from the moment formula (jHJ) 

ETr[(RR*Y]= J2 s^df^'^d*^'^. 
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Since the only parameter growing to infinity in the above sum is d 2 , the dominating term 
is given by the permutation = 7 and thus 

(15) ETi{(RR*) p ]~dl P s p dl 

showing the convergence in moments of the empirical singular value distribution. 
The statement about the variance follows readily from the following formula 

ETr 2 [(RR*)P}= J2 s^dt^^d*^ 5 ^. 

We note that the dominating term is given by a = 712 and that it cancels out with the 
square of the right hand side of equation (IT5"j) . The largest remaining terms correspond 
to permutations a with #(«7f 2 1 ) = 4p — 2. □ 

We now move on to the proof of Theorem 17. 1[ which mimics the one of Proposition 
13.11 This approach gives the exact threshold in the unbalanced case because the limiting 
measure is a Dirac mass, so the bounds given by the moments 2 and 4 are already tight. 

Proof of Theorem \7.1\ In the present case, equation (J5J) reads 

II Rll 3 

NfHlj ^ II DM <- J || DM 

j- p. ^ \\R\\t ^ di||-K|| 2 . 

Using the previous proposition and the concentration for Gaussian polynomials (lemma 
13.51) . we can find constants c, C such that for every t > 0, 

P{\\\R\\l - d 2 2 d 2 lS {l + o{l))\ >Ctd 2 ) ^Cexp(-ct 1/2 ) 

P(|||i2||l-^d?s 2 (l + o(l))| >Ctd 2 ) ^Cexp(-ct 1/4 ). 

We choose t = rjd 2 for some 77 > 0. The previous two facts imply that with large 
probability, ||(did2s)~ 1 -ft||i is close to di/y/s (the difference being smaller than any fixed 
e > 0, for an appropriate choice of rj). Finally, one can replace R/(did 2 s) by p R , by using 
the fact that the trace of the Wishart matrix W concentrates around its mean d\d 2 s (see 
Lemma E2D. □ 
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